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jrt ' Abstract. Problems related to the existence of integral and rational 

points on cubic curves date back at least to Diophantus. A signifi- 
cant step in the modern theory of these equations was made by Siegel, 
r~^ ' who proved that a non-singular plane cubic equation has only finitely 

fSJ I many integral solutions. Examples show that simple equations can have 

inordinately large integral solutions in comparison to the size of their 
coefficients. A conjecture of Hall attempts to ameliorate this by bound- 
ing the size of integral solutions simply in terms of the coefficients of the 



H , „ ^., 

f^ ^ • defining equation. It turns out that a similar phenomenon seems, con- 



^ 



jecturally, to be at work for solutions which are close to being integral 
in another sense. We describe this conjecture as an illustration of an 
2 ' underlying motif - repulsion - in the theory of Diophantine equations. 



m 

>• ' 1. Challenging Questions 

^n : 

In 1657, Pierre de Fermat challenged the English mathematicians Sir 
Kenelm Digby and John Wallis to find all the integer solutions to the equa- 
tion 
^O ■ „,2 , o _ ^3 



o 



o 



y^ + 2 = x-\ (1) 

^P . We can make an educated guess about his motivation. 

Diophantus [331 Book VI, Prob. 19] asked for a right triangle, the sum 
of whose area y and hypotenuse h is a square, and whose perimeter is a 
^ , cube. Taking the legs to be 2 and y, and h + y to he 25, he noted that the 

H I square 25, when added to 2, gives the cube 27. The two equations h'^ = 2/^+4 

and y+h = 25 give y = ^ as a solution to the problem. This is pleasing, but 
we are more interested in the intermediate observation, namely a solution 
in integers to the Diophantine equation ([1]). 

Bachet, in his famous 1621 Latin translation of the Arithmetica of Dio- 
phantus, noted that from the one solution 5^ -|- 2 = 3'^ other rational num- 
bers r may be found with the property that r^ -|-2 is a cube. Fermat acquired 
a copy of this work, and by 1636 had studied it carefully and made significant 
advances. He also recorded several of his most influential marginal notes in 
this book, including the comment concerning 'Fermat's last theorem'. 
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In 1656 the English mathematician Wallis, at the time chief cryptographer 
to Parhament, published his Arithmetica infinitorum. Digby brought this to 
the attention of Fermat, prompting Fermat to begin a correspondence with 
'Wahis and other English mathematicians' concerning some of his number 
problems. This correspondence all passed through Digby's hands, and thus 
in 1657 Fermat challenged Digby and Wallis: 

Pent on trouver en nombres entiers un carre autre que 25, qui, 
augniente de 2, fasse un cube? A la premiere vue cela pardit 
d 'une recherche difficile, en fractions une infinite de nombres 
se deduisent de la methode de Bachet; mais la doctrine des 
nombres entiers, qui est assurement tres-belle et tres- subtile, 
n'a ete cultivee ni par Bachet, ni par aucun autre dont les 
ecrits venus jusqu'd moi. 

That is, to find all the integer solutions to the equation ([T|). 

The solution (x, y) = (3, 5) noted by Diophantus is one, and Digby and 
Wallis might have agreed it is the only one, since negative numbers had 
yet to enjoy a fully equal status with their positive siblings. Presumably 
the challenge was to show that, sign apart, there are no others, as Fermat 
claimed a proof that this solution is unique. It is not clear from this distance 
in time if any of the three protagonists ever did have a complete proof of 
this. 

1.1. What if 2 had been 15? What might have been the reaction if Fermat 
had instead challenged the English mathematicians with the equation 

y"^ = x^ + 15? 

The solution (1,4) is easy to spot. It is less easy to notice that (109, 1138) 
is also a solution - and not at all obvious that, issues of sign aside, there are 
no others. 

Even more challenging, what about the equation 

y^ = x^ + 24? 

The solution (1, 5) is once again easy to spot, as is the solution (—2, 4), and 
it is not too difficult to find a third solution (10, 32). However, life is surely 
too short to find the solution (8158, 736844) without mechanical help. Once 
again, up to sign, this is the full list of solutions. 

As these examples show, finding the solutions to the Diophantine equa- 
tion y^ = x'^ + d depends in a rather unpredictable way on the constant d. 
Among the many results on this problem, some stand out. Euler showed 
that there are no positive rational solutions apart from (2, 3) when d = 1 
using the method of descent, and a long series of other special cases were 
solved by many other mathematicians (see Dickson [U Chap. XX] for the 
details). A snapshot of the state of knowledge on this question prior to 
Siegel's theorem may be found in the work of Mordell [21], where many but 
not all values of d for which there are no integral solutions are found. 
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Nowadays the numerical facts above - not only the stated solutions, but 
the much deeper claim that there are no other integral solutions - can be 
checked easily using one of the many sophisticated computational packages 
available, such as Magma [4]. This is greatly to be celebrated, although the 
facility with which these calculations can now be done risks obscuring the 
remarkable achievements made in Diophantine analysis over the last forty 
years that have made this possible. It is not our intention to survey these 
achievements (a detailed overview may be found in the lovely monograph 
of Hindry and Silverman |18j). although some highlights on the theoretical 
side will appear naturally. 

1.2. Satisfying Fermat. Out of a patriotic desire to satisfy Fermat, we 
start with a (now elementary) proof that (3, ±5) are indeed the only in- 
tegral solutions to y^ = x^ — 2. The language of (and one result from) 
modern algebra makes this straightforward. Plainly x must be odd, for 
otherwise y'^ = 2 (mod 4), which is impossible. The ring R = 'L\\/ —'2\ is a 
unique factorization domain (indeed, is a Euclidean domain - see Hardy and 
Wright [16^ §14.7] for an account) - whose only units are ±1. Factorizing 
there gives 




In the ring i?, gcd(y + ^/—2,y — -y/— 2)|2\/— 2. Since x is odd, the greatest 
common divisor is a unit, and so must be ±1. It follows that each factor is 
a unit multiple of a cube in R. Write (absorbing —1 = (—1)^ if necessary) 

y + \/^ = (a + h^/^f 

with a,b £ Z. Comparing coefficients of \/— 2 gives 

1 = 30^6 - 26^ = 6(3a^ - 26^) 

which forces b = 3a^ — 2b^ = ±1. Solving gives 6=1 and a = ±1, and the 
two choices give y = =f5 and x = 3. 

2. Siegel's Theorem 

The following is a special case of a wonderful and deeply influential result, 
proved by Siegel. Siegel's result was more general, but for the case at hand 
may be stated as follows. 

Theorem 2.1 (Siegel f3Dj, 1929). Given an integer d ^ 0, the equation 

y"^ = x^ + d (2) 

has only finitely many integral solutions. 

This specific result was known earlier. Mordell [23j reported to the Lon- 
don Mathematical Society in 1918 that an earlier result of his [22], in con- 
junction with Thue's work [Mj, would show Theorem 12.11 Once again there 
is an important letter, sent by Siegel to Mordell in 1925 (an extract appears 
in [29]), outlining Siegel's ideas which eventually gave the general result [30] 
that any non-singular cubic curve has only finitely many integer points. In 
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honour of Mordell's contribution to this subject, the equation ([2]) in Theo- 
rem [2]T] is often known as Mordell's equation. 

Even in its simplest form, Siegel's proof was recognized as non-effective: 
a term whose meaning will be discussed shortly. Siegel gave a second proof 
of finiteness using unique factorization, in a manner very similar to the 
proof in the case d = —2 above, by working in a suitably large ring and 
then reducing to a number of simpler equations called S'-unit equations. 
Later this led to an effective proof, following Alan Baker's seminal work 
on transcendence theory. The terms effective and non-effective, although 
ubiquitous in number theory, are never precisely defined. 

effective (adj.): adequate to accomplish a purpose; producing 
the intended or expected result; 1350-1400; ME fr. L. effec- 
tivus = practical. 
In a nutshell, still lacking a mathematical definition, one might expect that 
an effective proof is one which produces an algorithm to implement the 
conclusion of a theorem. For example, in our context, an effective proof 
might consist of a bound on the size of the largest solution. This would 
allow - in principle - all solutions to be found, by simply checking integers 
below that bound to see if they satisfy the equation. This sounds desirable 
and practical, but there are examples where the gap between what theory 
provides and what is practical remains large. 



Theorem 2.2 (Baker [T], 1968). Any integral solution of ^ satisfies 

\x\ < e 



,ioio|d|io 



How large is the number on the right hand side? It is said that the number 
of electrons in the visible universe does not exceed 10^*^, while those on the 
earth amount to a little over 10®". Each of these quantities is minuscule 
compared to the bound in Theorem 12.21 even for modest values of d. It 
is tempting to imagine that with the speed of modern computers, it would 
nonetheless be practical to check values of x below that bound, giving a 
complete solution for any given d of reasonable size. Before attempting this, 
it is wise to think through some realistic estimates for the type of computer 
one might have at home. When d = —2, you will probably find it takes under 
a minute to check if x satisfies ([2]) for |a;| < 10®, over an hour to reach 10^. 
There are roughly 10^ personal computers in the world (and this figure is 
likely to double by 2015). If you owned them all and used them in your 
checking, it would take at least a billion billion billion billion billion billion 
years to check the equation for values of x with |x| < 10^*^, and this is well 
short of the bound in Theorem 12. 2[ Even checking Euler's proof that x = 2 
gives the only positive integral solution to y'^ = x^ -\- 1 is not feasible using 
Baker's bound. 

So, despite the etymology, effective as used in number theory (which 
would certainly include the statement in Theorem l2.2|) does not always mean 
practical. 
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Of course Theorem 12.21 was never intended to be a practical tool, and 
was never claimed to be such. It merely says that with the methods then 
available, this is the best bound obtainable, and sets a challenge for future 
generations. Dramatic improvements followed fairly quickly. For example, 
Stark |32] shows that 

max{|x|,|y|} < e^l'^''^", 
where C = C{e) is an effectively computable constant depending only on e. 
This is a considerable improvement upon Baker's bound, however, the size 
of C(e) is necessarily large, and checking small values is not computationally 
feasible, even for small values of \d\. 

Despite the power of these results and the direction of research initiated by 
the work of Baker and Stark, the modern computational facility for solving 
Mordell's equation came about not by further reducing the size of the bounds 
above. The method is actually less direct (see the work of Gebel, Petho, 
and Zimmer (13] for complete details about the toolkit now used). In a 
later paper [13], the authors show how this method resolves the equation 
for \d\ ^ 10^, and for almost all d with \d\ ^ 10^ 

Today, after major theoretical and computational improvements, com- 
puter packages will now find all the integral solutions of equations y^ = x'^+d 
provided d lies within reasonable bounds. To give an idea of what counts as 
reasonable it is worthwhile doing some experiments yourself. 

3. Hall's Conjecture 

To reiterate, solving Mordell's equation in practice does not rely upon 
obtaining a very strong upper bound for the size of the largest solution, 
then checking smaller solutions on a computer. In fact an important, simple, 
natural, question remains unsolved to this day. 

Question 1. What is the best theoretical bound (in terms of d) for the size 
of the largest integer x solving y^ = x^ + d? 

The truth is that the best known bound is far from what seems likely 
to be true. What seems likely to be true is the subject of the following 
conjecture made by Hall [15], which we will now discuss. Hall's conjecture 
has been subject to extensive numerical checking, but a proof seems to 
require dramatically more powerful methods than those currently available. 

Conjecture 1 (Halo). Given e > 0, there is a constant C = C{e) such 
that, for any non-zero d G Z, any integral solution of y^ = x^ + d satisfies 

log|x| < {2 + e)log\d\ +C. 

The audacious nature of the conjecture is not immediately apparent. Our 
second and third examples {d = 15 and d = 24) show that simple equations 
with small coefficients can have enormously large integral solutions. What 



Originally, Hall [15] conjectured the same bound but with e = 0. This is no longer 
thought to be likely. 
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Hall's conjecture suggests is that, when properly calibrated, the phenomenon 
of large integral solutions of an equation with small coefficients is not beyond 
constraint. 

The conjecture of Hall follows from the infamous ABC conjecture, for- 
mulated by Masser and Oesterle [26] in 1985, about the relative sizes of a 
zero sum of three integers. Write 



r{N) 



Up 

p\N 



for the radical of an integer N. The ABC conjecture says that for any e > 
there is a constant C = C{e) such that, whenever 

A + B + C = 

in non-zero coprime integers A, B, C, we have 

max{|A|, \B\, \C\} ^ Cr{ABC)^+'. 

To see how this relates to the Mordell equation, assume that \x\^ is the 
largest of the three terms in ^. Then, roughly speaking, the ABC conjec- 
ture predicts that 

Ix^l ^ C\xyd\^+^, 

and the Hall bound follows by taking logs and noting that \y\ is approxi- 
mately |xp^. 

Hall's conjecture has been extensively tested. Table [U shows values of 
integers x and d with an integral y satisfying y^ = x^ + d, having log x large 
in comparison with 21og|(i|. It is taken from Elkies' website [9] (see also 
the paper tl4j). The table is surprising in two opposite senses: Firstly, it 
gives more examples of inordinately large solutions of simple Diophantine 
equations, and secondly, it shows that they nonetheless fall within sight of 
a reasonable constraint upon how large they could be when viewed on a 
logarithmic scale. 

Table 1. Large values of logx/21og \d\ in Hall's conjecture. 



d 


X 


logx 


logx/21og|d| 


-1641843 


5853886516781223 


36.305 


1.268 


-30032270 


38115991067861271 


38.179 


1.108 


1090 


28187351 


17.154 


1.226 


193234265 


810574762403977064 


41.236 


1.080 


17 


5234 


8.562 


1.511 


225 


720114 


13.487 


1.245 


24 


8158 


9.006 


1.417 


-307 


939787 


13.753 


1.200 


-207 


367806 


12.815 


1.201 


28024 


3790689201 


22.055 


1.076 
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Does Table [T] convince? Taken together with the imphcation of the ABC 
conjecture, the answer is probably yes, in the sense that this is evidence 
for a sensible conjecture. We have presented it because, later on, two more 
tables will appear and a direct comparison will be invited. 

4. Repellent Powers 

It might not be apparent so far, but the point at infinity is enormously 
important in understanding solutions of Mordell's equation. For example, 
the set of rational points on the curve forms a group under a natural geo- 
metric form of addition. The point at infinity is the identity element for this 
group. 

From our viewpoint, Siegel's Theorem may be interpreted to say that 
the point at infinity repels the integer points on the curve y^ = x^ + d. In 
other words, there is a (punctured) neighbourhood of the point at infinity 
free of integer points. Empirically, we might even say we observe integer 
points repelling each other. For a sophisticated instance of this repulsion 
property being used to understand integral points on elliptic curves, see the 
work of Helfgott and Venkatesh [T7] . In their work, integer points repel each 
other inside the Mordell-Weil lattice. The methods of [17J will yield explicit 
constants that might well quantify practically the rate at which integral 
points repel each other coordinate-wise. This repulsion between integral 
points will be something of a mantra throughout this paper, and will inform 
the latter part significantly. For now though, consider the idea of points 
with fixed arithmetic properties repelling each other as a kind of paradigm 
for understanding other results in Diophantine equations. 

The results stated so far may be seen as an instance of a general tendency 
for distinct integral powers to repel each other. Thus, for example. Baker's 
result may be phrased as follows. If x and y are positive integers with y^ ^ x^ 
then there is a constant C = C{x) with 

\y'^ -x^\ >C{\ogxf^~^; 

Stark's result says that for any k <1 there is a constant C = C{k) with 

|y^ -x^\ > C(logx)'', 

and Hall's conjecture says that for any e < 1/2 there is a constant C = C{e) 
with 

These are all statements about (subsequences of) the sequence 

a = (a„) = (1,4, 8, 9, 16, 25, 27, 32, 36, 49, 64, 81, 100, 121, 125, . . . ) 

of perfect powers: numbers of the form n™ with n,m G N and m ^ 2. A 
gap of one is seen early on, and the conjecture of Catalan [7\ is that 8 and 9 
are the only consecutive pair in the sequence a. The proof of this result 
followed a path that once again illustrates the slightly ambiguous way in 
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which the word effective is used. Tijdeman [35] used sharpened versions of 
Baker's theorem to find a number T with the property that any positive 
integral solution (x, y, m, n) to the Diophantine equation x™" — y^ = 1 must 
have maxjj;, y, m, n} ^ T. This meant that a proof of Catalan's conjecture 
was reduced to a finite list of possibilities to check - surely the most effective 
of effective statements. Unfortunately a by-product of the transcendence 
methods used was that the number T was enormous, leaving a finite but 
hopelessly impractical calculation to be done. The conjecture was finally 
proved by Mihailescu [20] using a mixture of methods, and eventually a 
beautiful, purely algebraic, proof was given by Bilu [3]. 

Pillai studied many properties of the sequence of perfect powers, and in 
a paper [27] of 1945 wrote 

/ take this opportunity to put in print a conjecture which I 
gave during the conference of the Indian Mathematical So- 
ciety held at Aligarh. Arrange all the powers of integers like 
squares, cubes, etc. in increasing order [...]. Let On be the nth 
member of this series [...]. Then 

liminf(a,i — a„_i) = oo. 

n— >oo 

The conjecture of Pillai is exactly equivalent to the conjecture that for 
any k ^ 1 the Diophantine equation x™ — y"^ = k has only finitely many 
solutions. This remarkable problem remains open, and is the subject of a 
recent survey by Waldschmidt [36] . 

4.1. The Gap Principle. The phenomena we are describing in this paper 
will be recognizable to workers in Diophantine equations although possibly 
under another name - the gap principle. Again, this is not formulated 
precisely anywhere that we can find. Roughly stated, it says that where 
Diophantine phenomena occur (say, as rational solutions to a Diophantine 
equation or to an inequality) subject to some reasonable constraint, they 
will respond by exhibiting measurable gaps. This data is then fed back into 
the technicalities of the argument. Strictly speaking, there is no single gap 
principle, it is more of a style of argument. We invite readers to explore the 
current literature on Diophantine equations to see where this term - or this 
phenomenon - occurs. 

We cite two examples, one old and one recent and very germane: In- 
gram [19] showed that large gaps occur between integral multiples of points 
on Mordell curves in the following sense. If ni < 712 and niP,n2P (in the 

2 

group theory sense) are both integral points, then a"i < n2 for some explicit 
constant a > 1. This is a strong repulsion property for integer points along 
a sequence (nP) and, just as with the results in [T7|, might well translate 
back into a good bound for the corresponding coordinates. An instance of 
the kind of applications Ingram obtains is that on Mordell curves of the 
form ([2]), for all large enough d with d sixth power-free (that is, not divisible 
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by a^ for any integer a ^ 2), there is at most one n ^ 3 such that nP is 
integral (see [HI Proposition 15]). 

Perhaps the earhest - and certainly a very influential - manifestation of 
a gap principle occurs in an old paper of Mumford ^5j. His result was 
about rational points lying on more complicated plane curves. Complicated 
here means not just higher degree but higher genus, a geometric measure of 
complexity - for example, Mumford's results apply to equations y^ = f{x) 
where f{x) E Q[x] has degree at least 5, and /(O) ^ 0. He showed that 
the rational solutions exhibit naturally occurring, expanding, gaps (when 
viewed in the most naive sense, so that the numerators and denominators 
grow large very quickly) . Although in one sense his result was superseded by 
Faltings' general proof that such curves contain only finitely many rational 
points, packages such as Magma [3] will now enumerate rational points on 
higher genus curves with some ease. The repelling nature of them brings 
Mumford's remark vividly to life. 

Example 4.1 (Taken from f5]). The genus 2 curve y'^ = x^ + 1025 has the 
following rational points, together with sign changes: 

,,33,(|,|^),,,.3,,(i.^).(|,?l|^). 

5. Generalizing Siegel's Theorem and Hall's Conjecture 

We will now describe a recent attempt to generalize both Siegel's Theorem 
and Hall's conjecture in one go. We need to begin by talking a little about 
rational solutions of our equations. Although it is not obvious, there are 
infinitely many rational solutions to each of our three starting equations. 
In each case, there is a way to produce them all, starting with a finite 
set of rational points. This fact, casually stated, obscures the enormous 
theoretical and practical knowledge about rational solutions (as opposed to 
integral solutions). For example, the rational points on 

y"^ = x^ + 15. (3) 

are all generated from the two points (1,4) and (|, ^), using the chord and 
tangent method of constructing new points. The essential observation is 
that the line joining any two rational points on the curve (or the tangent to 
a single rational point) intersects the curve again at a third rational point. 
For example, the line joining our two initial points meets the curve again 

at ( g-, 2y)- 

This operation, together with reflection in the x-axis, allows all rational 
points on the curve to be found — but this is not an easy result to prove. 
To help orient the reader, notice that this is an instance of MordeU's Theo- 
rem [24] , which says that the set of rational solutions of such an equation is 
always generated from a finite set of rational solutions in the same geometric 
way. 
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Indeed, the operation of joining two points with a hne, finding the third 
point of intersection, and then reflecting in the x-axis (extended by continu- 
ity to allow the original points to be identical) is a binary operation giving 
the set of rational points on the curve the structure of an abelian group (once 
the point at infinity is added), and Mordell's theorem states that this group 
is finitely generated. It must therefore have the form W y. F for some finite 
group F, and r is called the rank of the curve. The quantity r, like d in ([2]) 
or A^; in Section [71 is an important measure of the size or the complexity 
of the curve. 

To fix notation both now and for the sequel, note that if P = (x, y) is a 
rational solution of y^ = x^ + d and d is integral, then x^ and y^ have the 
same denominator. Thus the denominator of y^ must be the square of a 
cube, and that of x^ the cube of a square, so we may write 

'Ap Cp' 



with Ap, Bp, Cp ^"Lva. lowest terms. 

Siegel's Theorem makes a statement about the rational solutions P of 
the equation Bp = 1, which we may interpret as a statement about the 
solutions (x, y) under the condition that the denominator of x (or of y) is 
divisible by no primes. 

Question 2. What can be said about the points P such that Bp is divisible 
by one prime? 

In other words, what can be said about the set of rational solutions P with 
the property that Bp is a prime power? There is a name for points of this 
form. If the point P has Bp composed of primes from a set S then P called 
an S-integral point. The Siegel-Mahler Theorem predicts that for a fixed 
finite set of primes 5, there are only finitely many 5-integral points. What 
we are doing is slightly different, in fixing the size of S (typically IS"! = 1), 
but not its contents. Reynolds [28] proved (under a general form of the ABC 
conjecture) that only finitely many pure powers will occur among the Bp, 
so the first interesting case is when Bp is a prime. 

Sometimes, it is provable that only finitely many points P have Bp equal 
to a prime. Indeed, this is true of our starting equation 

y2 = x3 - 2. (4) 

To see this, note that on the curve ([4]), all the rational points can be gen- 
erated by the single point (3, 5). Since this point is the image of a rational 
point under a 3-isogeny, [111 Theorem 1.3] applies to demonstrate the finite- 
ness claim. Inspection suggests that, in truth, none of the rational points 
yield prime values Bp but, in a familiar refrain, the amount of checking of 
small cases remains, as yet, unfeasible. 

On the other hand, searching yields a large number of rational points P 
on equation ([3]) with Bp equal to a prime. The examples below, and all 
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that follow, were obtained using searches in Pari-GP [2j. What prevents us 
exhibiting many more examples is lack of space, not data. Readers interested 
in following this up should consult [12]. 

Example 5.1. The following x coordinates of rational points P on the 
curve y'^ = x^ + 15 have Bp equal to a prime: 

xiP) = 



11 

75721 

578509 
367^ ' 
349755479 
11909^ ' 



X(P) 
X(P) 

x(P) = 

^/p\ _ 556386829130869 „„j 
J-I-TJ — 176841892 ) anu 

X(P) 



176841892 
64892429414388628056900713281 
2594769767501772 



6. The condition that Bp is a prime 

Computational evidence, as well as a heuristic argument [HI 12], suggest 
that when the set of rational points is generated by more than one point, as 
in equation ([3]) for example, there will be infinitely many points P with Bp 
equal to a primqj. Although the papers [TTl |T2] contain a great deal of 
numerical evidence, as well as a reasoned heuristic argument, we must admit 
that the conjecture has not been proved, even for one curve. 

The question we now ask can be put roughly as follows. 

Question 3. Where are the rational points P with Bp a prime? 

It makes sense to set the question in a slightly wider context, even though 
our main concern lies with the case stated. 

Definition 6.1. The length of a rational point P is the number of distinct 
prime divisors of Bp. 

Thus the integral points are precisely the length zero points. Length 1 
points are those with Bp equal to a prime power. Our comments will apply 
to rational points whose length is bounded, but the question above asks 
simply for the location of the length 1 points. As discussed in Section HI 
Siegel's Theorem may be interpreted to say that the point at infinity repels 
length zero points. Is it possible that the same might be true for length 1 
points? In other words, do length 1 points have bounded x-coordinates? 

A first attack is computational. To increase the options, a search was 
made on curves y"^ = x^ + Ax + B with AA^ + 27B^ / 0, that is, on 
elliptic curves (which is the proper setting for this question for reasons we 
will expand on later). This does yield examples of length 1 points with 
inordinately large x-coordinates. 



Provided a technical assumption is met, namely, that the set of points should not lie 
in the image of a rational isogeny. 
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Example 6.2. The curve y^ = x'^ — 7x + 10 contains a rational point P 
with 

Bp = 14476032998358419473538526891666573479317742071, 

a prime, and with 

x{P) = 175567.984... . 

The x-coordinate has been expressed as a real number to emphasize its 
rough size - this is less apparent when the number is written in rational 
form with a numerator and denominator. 

6.1. Generalizing Siegel's Theorem. Does this mean that the obvious 
generalization of Siegel's Theorem is false? One might think so given Ex- 
ample 16.21 above, and others like it - but that is to miss the flow of our 
argument thus far. Drawing a parallel with our earlier comments, the ev- 
idence both from examples of Siegel's Theorem and from the constraints 
along the lines of Hall's Conjecture suggests that the right parameter to 
measure is logx/log|(i| (for a suitable notion of the parameter d adapted 
to more general curves). Viewed on this logarithmic scale, we present com- 
putational evidence (see Table [2]) to suggest that infinity does indeed repel 
length 1 points. In other words, we are suggesting that the generalization of 
Siegel's Theorem becomes more plausible when strengthened along the lines 
of Hall's conjecture. 

More even than this, an elliptic curve is fundamentally a projective object 
(in particular, the process of adding the point at infinity as the identity 
element can be formalized by viewing the curve as an elliptic object). What 
this means is that its true nature only becomes revealed when viewed as 
lying in projective space. On that basis, there are no specially favoured 
points: although the point at infinity is chosen traditionally as the identity 
for the group law, the group structure makes sense with any chosen rational 
point as identity, with the appropriate changes. Thus we are drawn, perhaps 
with some trepidation, towards a belief that all rational points (or even all 
algebraic points) repel the points with length below a fixed bound. In other 
words, around each algebraic point there is a punctured neighbourhood free 
of bounded length points. 

In order to examine this conjecture, as well as to frame it along earlier 
lines, we introduce a suitable notion of the distance between two points. 

Definition 6.3. Given an algebraic point Q, we define the logarithmic dis- 
tance from P 7^ Q to be 

hQ{P) = -\og\x{Q) - x{P)\ 

when Q is finite, and 

log|x(P)| 
when Q is the point at infinity. 
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To help swallow the looking-glass nature of Definition [631 note that points 
close to 'infinity' are considered large. For the sake of consistency, logarith- 
mic distances work the same way, so that points which are close to each 
other turn out to have a large logarithmic distance - a general feature of 
such distances in Diophantine Geometry. To see why this is natural, notice 
that a measure of the size of a rational number | in lowest terms is given 
by the height H(^) = max{|o|, |6|}. Then a rational number very close to, 
but not equal to, and specified number necessarily has very large height. 

Conjecture 2. Let k denote a fixed positive integer. Assume d is sixth 
power-free. Then the set of algebraic points repels the rational points with 
length below k (the 'Siegel part' of the conjecture). If Q denotes a fixed 
algebraic point, then there is a constant C = C{k, Q) such that 

hQ{P)^C\0g\d\ 

for any rational point P with length below k (the 'Hall part'). 

The condition on d, that it is not divisible by any sixth power, is a natural 
one. Without this condition, we would be free to multiply the equation 
through by sixth powers of integers to create more and more integral and 
length 1 points in an artificial way. The general hypothesis, of which this is 
the special case for y^ = x'^ + d, is that an elliptic curve be in minimal form. 

In the next section, computational evidence for Conjecture [2] will be pre- 
sented. Notice that some evidence for Conjecture [2] already comes from the 
data examined in the first part of the article. The conjecture predicts that 
integral points are not simply repelled by the point at infinity, but also by 
each other - exactly what we observe. Conversely, the conjecture too sheds 
light upon the data. Looked at this way, we should not be surprised that, 
when more than a couple of integral points exist, the outliers are forced to 
lie a long way out. 

7. Computational Evidence 

This was obtained in [10] for elliptic curves (see [U EI] for background) 
of the form 

E -.y^ + aixy + a^y = x^ + a2X^ + a^x + ae 
with ai, . . . , ae G Z. The role of the parameter d, measuring the complexity 
of the curve, is played here by 7^ A^; G Z[ai, ...,03], the so-called discrim,- 
inant. This is a (complicated) explicit polynomial in the variables oi, . . . , ag- 
Write He = log|A£;|. Each curve E is recorded as a list [01,02,03,04,06], 
and Table [2] shows what we believe is the maximal distance h from infinity 
of the length 1 points on that curve. 

The curves are all in minimal form and all have rank 2, and the search 
range runs over length 1 points of the form mP + nQ where P and Q 
denote generators of the torsion-free part of the group of rational points, 
and |m|,|n| ^ 150. The search range is necessarily constrained because 
checking that points have length 1 requires primality testing on some large 
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integers. In every case, the largest value h occurs fairly early, strengthening 
our belief it is truly the maximum - see [lOj for details. Conjecture[2]predicts 
that the ratio h/hE will be uniformly bounded. Readers are invited to 
compare this table of values with the one for Hall's Conjecture itself (Table[I]) 
given earlier. 

Table 2. Infinity repelling length 1 points 



E 


IAeI 


h 


h/hE 


0,0,1,-199,1092 


11022011 


12.809 


0.789 


0,0,1,-27,56] 


107163 


11.205 


0.967 


0,0,0,-28,52] 


236800 


13.429 


1.085 


1, -1, 0, -10, 16 


10700 


9.701 


1.045 


1,-1,1,-42,105 


750592 


8.136 


0.601 


0, -1, 0, -25, 61 


154368 


16.592 


1.388 


1, -1, 1, -27, 75 


816128 


12.363 


0.908 


0, 0, 0, -7, 10 


21248 


12.075 


1.211 


1, -1, 0, -4, 4 


892 


11.738 


1.727 


0, 0, 1, -13, 18 


3275 


6.511 


0.804 


0, 1, 0, -5, 4] 


4528 


7.377 


0.876 


0, 1, 1, -2, 0] 


389 


9.707 


1.627 


1, 0, 1, -12, 14 


2068 


9.819 


1.286 



Table [3] shows some curves with Q = (0,0), and the maximal distance Hq 
from Q of the length 1 points. The remarks about search ranges and our 
confidence about the true nature of the maximum apply here as before, 
see also p^. Note that points P close to Q = (0,0) yield a large values 
of Hq^P) because these are logarithmic distances. The thrust of Conjecture[2] 
is that these values, properly scaled, are not inordinately large, but satisfy 
a reasonable constraint. Once again, a direct comparison is invited between 
this table and the one given earlier in Table [TJ 



Table 3. Curves with Q = (0,0). 



E 


\^e\ 


hq 


hg/hE 


[0, 0, 0, 150, 0] 


216000000 


6.436 


0.335 


0, 0, 0, -90, 


46656000 


3.756 


0.212 


[0,0,0,-132,0 


147197952 


4.470 


0.237 


[0,1,0,-648,0 


17420977152 


0.602 


0.025 


[0,0,0,34,0] 


2515456 


2.107 


0.143 


[0,0,0,-136,0 


160989184 


0.279 


0.014 


[0,1,0,-289,0 


1546140752 


5.712 


0.269 
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8. A Theorem 

Given the speculative nature of this study, it is a httle surprising to find 
certain conditions where the Siegel part of the conjecture can be proved 
unconditionahy, and the Hall part conditionally, at least when Q is the 
point at infinity. What follows is a modest result, but it does yield examples 
where further testing may be carried out, and it provides some support for 
Conjecture [2j Recall that rational points Qi and Q2 are independent in 
the group law on the rational points of the curve if there is no point P 
with Qi = aP and Q2 = bP for integers a, b. 

Theorem 8.1 (Everest and Mahe [IDJ). Consider the equation 

y^ = x^ - Nx 

for some < iV G N. Assume that Qi and Q2 are independent rational 
points with x{Qi) < and with x{Q2) equal to a square. 

(a) ('Siegel part') There is a bound upon \x{P)\ as P runs over length 1 
points in the group generated by Qi,Q2- 

(b) ('Hall part') AssuTue additionally that the ABC conjecture holds 
in Tj. Then there is a constant C, independent of N , for which 

log|x(P)| ^ClogA^, 

for all length 1 rational points P. 

Example 8.2. The points Qi = [-9,9],Q2 = [49/4, -217/8] on the curve 

satisfy the hypotheses of Theorem l8.11 Computations support the belief that 
infinitely many length 1 points lie in the group generated by Qi and Q2- 

Example 8.3. The points Qi = [-9, 120],Q2 = [841,24360] on the curve 

y^ = x^ - 1681x 

satisfy the hypotheses of Theorem 18.11 Note that x{Q2) = 29^. Also, in 
this case Qi and Q2 are generators for the torsion-free part of the group 
of rational points. As before, it seems likely that infinitely many length 1 
points lie in the group generated by Qi and Q2- 
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